UJ 




ED2900A 



INTRODUCTION TO 



DESIGNING WITH THE 

1 Am2900 FAMILY OF 

a 

lU 



MICROPROGRAMMABLE 



S BIPOLAR DEVICES 



I- 
0) 
3 
O 



LECTURE 
VOLUME I 



ED2900A 



INTRODUCTION TO DESIGNING WITH THE Am2900 FAMILY 
OF MICROPROGRAMMING BIPOLAR DEVICES 

Volume I 



3rd Edition 



January 1985 

Advanced Micro Devices, Inc. 

Customer Education Center 



ADVANCED MICRO DEVICES ^ 



Volume I 
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INTRODUCTION TO DESIGNING WITH THE ftn2900 FAMILY 
OF MICROPROGAMMING BIPOLAR DEVICES 



EDUCATIONAL OBJECTIVES 



1. Understanding the digital-computer, machine-instruction 
sequencing process (macro level) and associated architecture 
at the lower level (micro level). 

2. Appreciation of digital -computer control-unit organization 
for machine-instruction sequencing and its Implementation 
with Am2900 family devices. 

3. Appreciation of digital-computer, arithmetic/logic unit (ALU) 
organization and its implementation with Am2900 family devices. 

4. Understanding microprograrmiing terms (mnemonic programming 
at the micro level). 

5. Understanding Am2900 family support devices for constructing 
an instruction sequencing system at the micro level. 
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INTRODUCTION 



WELCOME TO THE WORLD OF MICR0PR0GAMMIN6 AND THE flin2900 FAMILY 
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Ea2900A EDUCATIONAL GOALS 



" Introduction to the Advanced Micro Devices (AMD) Ain2900 family 
of devices and their use. " 



DAY 1 

• Introduction to bit-slice architecture, microprogramming, 
microprogram sequencers (controllers) and their use. 



DAY 2 

• Study of arithmetic/logic units (ALUs), their use (algorithms) 
and interfacing to sequencers 



DAY 3 

t Analysis of support chips for systems support and specialized 
applications: 

Devices for dealing with interrupts 

Register expansion for ALU's 

Registered PROMs for ALU's 

Shift and status control devices 

Microprogrammable clocks 

16-bit and 32-bit ALUs 
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TECHNOLOGY TRENDS 



EARLY 
1960s 



LATE 
1960s 



SMALL SCALE INTEGRATION (SSI). 2-10 GATES PER CHIP. 

NAND gates 

NOR gates 

XOR gates 

NOT gates (inverters) 

Individual flip-flops (storage) 

256-bit RAM 

MEDIUM SCALE INTEGRATION (MSI), 20-100 GATES PER CHIP, 

Registers/Latches 
Decoders/Encoders 
Multiplexers 
Adders/ Comparators 
Arithmetic/Logic Units 
IK-bit RAM 



1970s LARGE SCALE INTEGRATION (LSI), 200-500 GATES PER CHIP. 

RALU-Arithmetic/Logic Unit (ALU) with registers 
Interrupt controller/Direct Memory Access controller 
Microprogram sequencer/Macro program controller 
Memory control! er/Input-Output controller 
Microprocessors 
16K-bit RAM 



1980s 



VERY LARGE SCALE INTEGRATION (VLSI), MORE THAN 1000 GATES, 
16-bit Bipolar and MOS ALUs 
16 and 32-bit Bipolar and MOS microprocessors 
Multi-mode arithmetic on expandable RALUs 
Special Data Manipulation (FFT, Signal processing, ooo) 
256K-bit RAM 
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Example of Bipolar Speed/Density Iin)rov«Bents 



Am2901 FOUR-BIT MICROPROCESSOR SLICE 



540 GATES 
SOOmW 
40-PIN DIP 
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DIE 
SIZE 



SPEED 
A, B G, P 



Am2g01 
33,000 MILS2 



80ns 



TECHNOLOGY 



LOW-POWER 
SCHOTTKY 



Am2901A 
20,000 MILS2 



65ns 



DUAL LAYER 
METAL ION- 
IMPLANTATION 



1975 



1977 



Am2901B 
15,000 MILS2 



50ns 



PROJECTION 
PRINTING 



1978 



Am2901C 
15,000 MILS2 



37ns 



ECL INTERNAL 

TTL I/O 

IMOX 



1981 
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TECHNOLOGY TYPES 

(see Am2900 Family Data Book and Figure on next page) 

" OF WHAT IS THE ACTUAL SEMICONDUCTOR CHIP MADE? " 

• BIPOLAR - Earliest technology 

Fastest technology 
Transistor-Transistor Logic (TTL) 
Emitter-coupled Logic (ECL) 
Ion-implanted oxide-isolation (IMOX) 

TTL external/ECL internal 
IMOX used in Am2900 family 

• MOS - Developed later than Bipolar 

Higher chip density 

Slower speed relative to Bipolar Technology 

Used in microprocessor chips (e.g. Intel 80286, Z8000) 

PROBLEM 

How do you build a large circuit (e.g. a microprocessor) 
with bipolar speed if it won't fit on a single chip? 

SOLUTION 

Use a bit-slice architecture! 
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10,000 r- 



1.000 
GATES 



CHIP 



100 



10 



Bipolar Density Improvenients 



Am2900 Bipolar LSI/VLSI 



SSI. MSI FAMILIES 

T ^ GOLD DOPED 

S = SCHOTTKY 

L = LOW-POWER SCHOTTKY 

A = ADVANCED SCHOTTKY 



Am291l6 
(2b00 GATES) 

Am2903 ^^ fcl 

(630 GATES) ^^f"""^ 
•^n Am2910 
(736 GATES) 




1— J 



1965 



1970 



1975 



1980 



'Moore's Law" 



Moore's Law - Gates/chip increases by a factor of four 
approximately every two years. 
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BIT-SLICE ARCHITECTURE 



Since chip density is limited, a small processor chip 
(typically four bits wide) is made in such a way that 
several of these chips can be hooked together as building 
blocks to make a larger (8-, 16-, 24-, 32-, 64-b1t) 
processor. This is defined as bit-slice architecture. 



This hardware implementation requires special features to 
handle problems like carry overflow, sign bit, etc. that 
involve data movement between slices. 



Note that the term microprogramming has not yet been defined. 
Microprogramming and bit-slice are two separate concepts, 
although they are closely related in most of the Am2900 
family. Bit-slice generally refers to the structure of 
various devices and how they are connected. Microprogramning 
concepts involve the method by which these devices and others 
are controlled. 
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16 BIT ADDER/REGISTER 



16-BIT INPUT 




16-BIT OUTPUT 
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THERE ARE THREE BASIC IMPLEMENTATION CHOICES (LEVELS): 



t SSI /MS I Hardware 



• Bit-slice (LSI/VLSI) "Firmware"* 



• MOS Microprocessors (LSI /VLSI) Software 



* "0"s and 'T's stored in a Read-Only Memory (ROM) 
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USE BIT SLICES TO BUILD SYSTEMS 



• MACHINES WITH LONG NORD LENGTHS 

16, 24, 32, 36, 64 bit words and beyond 

• MACHINES WITH SPECIAL MACRO LEVEL INSTRUCTION SETS 

Emulators - such as Nanodata QM~1 
MIL STD 1750 computers 
Controllers 

• FAST MACHINES - 100ns cycle times 

Real-time data control 
Real-time complex arithmetic 
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WORD LENGTH 



FIXED INSTR. 



4,8, or 16 bit 
fixed 



BIT SLICE 

any multiple 
of 4 



SSI /MSI 
any length 



CHIP COUNT FOR SIMPLE SYSTEM 



FIXED INSTR. 



3-6 



BIT SLICE 



30-60 



SSI /MS I 
100-500 



FIXED INSTR. 



pre-determined 



ARCHITECTURE 

BIT SLICE 

largely user 
defined 



SSI/MSI 

completely 
user defined 



FIXED INSTR. 

pre-determined; 
primitive 



INSTRUCTION SET 

BIT SLICE 

user-defined 
in firmware 



SSI/MSI 

user-defined 
firmware/ 
hardware 



FIXED INSTR. 



cheapest: 
use 9080A 
whenever 
possible 



CONCLUSIONS 

BIT SLICE 

use whenever 
high speeds and/or 
unique instructions 
are needed 



SSI /MSI 

fastest: use 
Schottky MSI where 
very high speed 
is a must 
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IF YOU'RE GOING TO BUILD A BIPOLAR MACHINE, YOU SHOULD USE 

LSI 

• LSI reduces costs (less chips and connections) 

• LSI improves reliability (fewer total pins) 



IF YOU'RE GOING TO BUILD A BIPOLAR MACHINE IT SHOULD BE 
MICROPROGRAMMED INSTEAD OF USING HARDWARE LOGIC 
(Mlcroprogramnlng is a level above hardware logic) 



• Easier design, using application-specific variable names 
and operations 

• Easier implementation 

• Easier testing 

• Easier maintenance 

• Better documentation (easier to understand) 



ADVANCED MICRO DEVICES ^ 



1-160 ED2900A 1-160 



MICROPROGRAMMING 



« In order to appreciate the position of the microprogramming level 
(micro level) in systems design consider the ... 



HIERARCHY OF COMPUTER ALGORITHM DESCRIPTIONS/LANGUAGES 

- Higher-order languages (compiler/interpreter translators) 

- Lower-order languages (assembler translators) 

- Machine language (macro level) 

- Register-transfer languages-RTL (microprogramming) 

- Boolean algebra (symbolic logic - state diagrams) 

- Logic levels (timing diagrams - waveforms) 



Note: One can design, implement and test algorithms on any one 
or more of the above levels, the choice depending upon 
application and constraints. Specific languages at each 
level are used to define a desired algorithm as well as 
its implementation. Various design approaches using some 
of the above languages are employed in this course. 
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MICROPROGRAMMING IS A TECHNIQUE FOR 
DESIGNING COMPUTER CONTROL UNITS (CCUs) FOR CONTI«)LLERS 



Instead of defining information movements and manipulations 
In terms of Boolean algebra, they are described on a higher 
symbolic level using register and arithmetic/logic operation 
designations {register transfer language-RTL). With 
Boolean algebra, all hardware operations are described at 
the logic level. RTL permits a more concise description 
of the desired process using names and operations reflective 
of the original design process. 



Initially consider computer control as an example of a 
microprogrammed architecture, i.e. 



Memory IziZI 



Computer 
Control 
Unit 
(CCU) 



JK 






1 1 



Ji/ 



Input/; 

Output: 

Unit ; 
(I/O) ; 



X 






Arithmetic/ I 

Logic j 

Unit I 

(ALU) I 
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MICROPROGRAhflING DEFINITIONS 



Using a register transfer language (i.e. microprogramming) 
to define desired information movements and operations 
permits the system to be developed with a hierarchical 
modular (chip and firmware-RTL) structure. For example, 
ALU bit-slice chips are given a coded CCU command, such as 
ADD Register 2 to Register 1. The ALU bit-slice chips then 
execute the operation internally with the CCU not having to 
control the exact step by step addition process 



Microprogramming then consists of defining in an encoded 
fashion using system variables (registers/variables operations), 
a step-by-step process of information movement and manipulation. 
The mnemonic microprogram is then decoded into zeros and ones 
and is put into a PROM. Each line statement or sequence of 
ones and zeros of the PROM program is sent to the ALU or other 
system chips under clock control for proper sequential execution. 



Microcode 



110100010 

010101011 bit patterns control 

001100111 individual logic gates 

110011000 
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GENERAL MICROPROGRAMMED ARCHITECTURE 



MACROINSTRUCTIONS 









1 






COND 
CODE 
MUX 




f 










cc 

Am2910 

I SEQUENCER 












:lock 






































MICROPROGRAM 
MEMORY 


















PIPELINE 
REGISTER 




DATA/ADDRESSES 
























( 








Am2901A 
ALU 
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REGISTER 
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GENERAL MICROPROGRAHHED SYSTEM 



fnOM DATA BUS 



ii 



INSTRUCTION 
REGISTER 



(PROM OR I'LAI 



I 



MAPPING 
PROM 



OTHER 
ADDRESS ' 
SOURCES ■ 



:> c 



12-3 Am2909A! 
OR Am2911A> 
OR Am2910AI 



12: 



CLOCK 

o 



So 

Si 

FE 

PUP 



CONTROL 

LOGIC 
(PROM.SSII 



V 



\^ 



Iz 



MICROPROGRAM 
MEMORY 
IPROMI 

256 TO 4k WORDS 



^ 



PIPELINE 
REGISTER 



V 



1 Microinstruction currently being executed 

2 Sequencer control lines select source of 
next microinstruction address 

3 Next microinstruction address 

4 Next microinstruction 

5 Status bits from current microinstruction 

6 Status bits from last microinstruction 



STATUS 
REGISTER 
(Am»18l 



FROM DATA BUS 



\7 



TO OTHER 

SYSTEM 

ELEMENTS 

le g ENABLES ON 

MAR IR DRi 



ii 



DATA 
REGISTER 



H 



D 

ALU 



i; 



TO DATA BUS 



'n + 4 
OVR 



® 



L 



PROGRAM 

CONTROL UNIT 

ANDOR MEMORY 

ADDRESS REGISTER 



IT 

TO 

ADDRESS 

BUS 
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TRADITIONAL HARDWIRED CCU: 



FROM MEMORY 



INSTRUCTION REGISTER 



DECODE LOGIC 



±z 



TIMING 
GENERATOR 



TIMING 
CONTROLS 



z. 



COMPLEX 

SEQUENTIAU 

COMBINATIONAL 

NETWORK 



CONTROL 

SIGNALS 

TO SYSTEM 

MICROWORD' 
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tIARDHIRED CCU 

Advantages 

• May be faster solution (execution time) 

• Custom designed for the specific problem 
May be smaller (part count and size) 

Justification 

• Suitable if design is rigid or fixed 
for high volume production 

Di sadvantages 

e Lengthy design time with Boolean algebra descriptions 
(logic equations) 

• Bulky documentation - long parts lists, detailed 
logic schematics, etc. 

o Any changes require partial or total redesign 

Pin count, board space high 

• Board may have very limited modular structure 
(modularity in design layout is difficult) 

Testing difficult - minimization effort is difficult 

• Debug at logic level is more complex than for 
LSI solutions 
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THE SIMPLEST CONTROL UNIT 
ecu - Conputer Control Unit 



NEXT 
ADDRESS 




LOAD NEXT ADDRESS 
ON RISING EDGE OF 
CLOCK SIGNAL 



_n_ 



CLOCK 



TIMING CONTROL 
SIGNALS TO SYSTEM 
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MICROPROGRAMMED CCU: 



• CCU memory, usually programmable read-only memory (PROM), 
contains a sequence of "microinstructions" 



« Each microinstruction contains two parts: 

- microinstruction sequencer portion contains CCU 
memory address of next word 

- controller portion contains control bits 
for system 



Advantages 

t Design now becomes a programming effort (software 
engineering) 

9 Development time shortened with appropriate tools 

• Major documentation contained in program listings 

• Changes may require little or no redesign 

• Part count small (mainly memory) 

e Modular, structured techniques can be easily applied 
Testing and debugging are easier 

Disadvantages 

• May be slower than hardwired CCU 
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WHY HICROPROGRAmiNG IS BETTER 



• More structured organization 

- random hardware logic is replaced by zeros and ones 
in a memory (PROM) 



• Field changes are easy - PROM replacement 



• Adaptations are easy (extendability) - additional PROMs 



• System definition can be expanded - additional chips & PROMS 



• Documentation and service are easier (understandability) 



- structured, modular microcode instead of possible 
unstructured schematics and wire lists 
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UU4GUAGE INTERREUTIONSHIPS 

It is helpful to develop a more detailed understanding of 
where microprogramming fits in relation to "conventional" 
levels of programming. 

• High Level Languages (HLL) - Basic, FORTRAN, Pascal, ADA, etc. 

- expressed in pseudo-math (Z=X+Y) 

- converted to machine language (ML) by ccwnpiler/ interpreter 

- each HLL statement translates into many ML statements 

- user is largely isolated from the particular hardware system 

- fixed instruction set (FIS) 

9 Assanbly Language 

- expressed in mnemonics (ADD Rl, R2) 

- converted to machine language by assembler 

- ratio to machine language statements is usually 1:1 

- user no longer isolated from knowledge of system hardware 

- fixed instruction set (operations and format) 
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e Machine Language 

- expressed In binary code (01101110) 

- each machine language instruction interpreted by a 
microprogram routine 

- fixed instruction set (operations and format) 

- knowledge of system hardware 

• Register Transfer Language (Nlcroprograirmlng) 

- direct control of hardware at register transfer level 

- must know complete system hardware 

- format of microprogram instruction statements defined 

- microprogramming often stored In PROM (firmware) 

• Boolean Language (Hardware logic) 

- logic function realization in SSI/MSI circuits 
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LANGUAGE RELATIONSHIPS 



SYSTEM DEVELOPMENT PSEUDO-ASSEMBLY 



HIGH-LEVEL 
LANGUAGE 



ASSEMBLY 
LEVEL 



MACHINE 
LEVEL 



MICROCODE 



DECREASING 

- PROGRAMMING EFFORT 

- ACCESS TO HARDWARE 



X 



CONTINUOUS SPECTRUM 
OF LANGUAGES 



INCREASING 

- PROGRAMMING EFFORT 

- ACCESS TO HARDWARE 
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COHPARING LANGUAGE IMPLEHENTATIONS 



BASIC 


ASSEMBLY 


MACHINE 


COMMENTS 




80BQA 


8080A (HEX) 




READ A, B, C 












IN CRD 


OB 


05 


INPUT FROM CARD 




MVI H, ADRH 


26 00 






MVI L, ADRL 


2E 


40 






MOV M, A 


77 




CRD -> MEM - A 




INX HL 


23 




INCR ADDRESS 




IN CRD 


08 


05 






MOV M, A 


77 




CRD - > MEM - 8 




INX HL 


23 








IN CRD 


DB 


05 






MOV M, A 


77 




CRD -> MEM - C 


LET A = A + 8 ■ 


- C 










MVE L, ADRL 


2E 


40 


RESET ADDRESS 




MOV A, M 


7E 




LOAD ACC <- A 




INX HL 


23 








ADD M 


86 




ADD ACC <- ACC + 




INX HL 


23 








SUB M 


96 




SUB ACC <- ACC - 




MVI L, ADRL 


2E 


40 


RESET ADDRESS 




MOV M,A 


77 




ACC -> MEM - A 



• Note that each Basic statement translates into 10 or so 
assembly language instructions and each assembly instruction 
translates into 1 or 2 words at the machine level. 



No attempt was made to make the assembly program efficient. 

- the intent was to translate directly from the Basic 
statements (one at a time) 
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MICROPROGRAMMING DEFINITIONS 



Microstore (control store, mlcromemory) 

- The ecu memory (often ROM or PROM) where microprograms are 
stored. 



Microprogram 

- A logically related sequence of microinstructions and/or 
microroutines. 



Microroutine 

- A sequence of one or more microinstructions which control 
a functional task (may implement one macroinstruction, for 
example). 



Microinstruction 

- The combination of all micro-operations or fields that 
specify the state of all control lines during a time 
interval (clock cyle). 



Micro-operation 

- The combination of one or more fields to control one 
functional unit, such as the ALU. 



Field 



One or more bits (binary digits) as needed to define a 
specific hardware activity for a functional unit such as 
an ALU arithmetic operation. 
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MACHINE LEVEL INSTRUCTION 



OP CODE 


DESTINATION 
R1 


SOURCE 
R2 


15 8 


7 4 


3 



MICRO-OPERATION 



MICRO-INSTRUCTION /FIELD 






A 








r ■ > 


I 


BRANCH 
ADDRESS 


Am2910 
INST 


CC 
MUX 


IR 
LO 


Am2903 
A& B 


Am2903 
SOURCE 


Am2903 
ALU 


Am2903 
DEST 


STATUS 
LOAD 


SHIFT 
MUX 


ETC 

























32 TO 128 BITS 



-H 
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INSTRUCTION REGISTER; 



MICROPROGRAM HARDWARE 



MICROROUTINE FOR A 



INSTRUCTION A —^ > 



INSTRUCTION B 



INSTRUCTION A -4 




MUX 



ALU 



*- SHIFTER 



MICROROUTINE FOR B 



Each machine instruction causes a specific microroutine to be executed. 
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MICROINSTRUCTIONS 



• The microword 1s typically \/ery wide (48-128 bits) because of 
the large number of control signals required to control 
system resources (functional units). 



• The microprogrammer and detailed hardware designer, if not the 
same person, must work as a team to define the required 
microword fields (hardware/firmware/software interface fuzzy!) 



e The microinstruction format is defined by these individuals. 



• There are no fixed rules with regard to format layout or limits 
on the number of formats permissible. Objectives should include 
ease of understanding, readability, testing, flexibility and 
extendability and the associated development of good documentation. 
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SUGGESTED PRACTICES FOR MICROINSTRUCTION FORMAHING 



• Use logical fields to increase readability. Worry about physical 
layout later. There are development tools to help in implementation. 



• Minimize the use of shared or overlapped fields (use horizontal 
format), as they reduce understandability. 



• Group fields as to the hardware functional unit micro-operations 
which they control for readability and understanding. 



• Group all micromemory next address fields at one end of the 
microword for readability. 
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DEVELOPMENT SYSTEMS 
FOR AIDING MICROPROGRAM DEVELOPMENT 



• META assembler - converts mnemonics to I's and O's. Initially 
requires a definition of microinstruction format and mnemonics 
(registers, operations). Then a microroutine (source) using 
the specified format and mnemonics is translated into I's and 
O's appropriately. 



« Microprogramming shortens the development effort considerably, 

• A development system simplifies debugging (error finding) 

- of microcoded routines 

- of hardware functional units and connections 

• Aids documentation by producing human readable code 

- "mnemonics" 



ADVANCED MICRO DEVICES d 



1-360 ED2900A 1-360 

MICROPROGRAMHEO CCU ADVANTAGES REVISITED: 

• Speeds comparable to Schottky TIL 

• Custom design at an RTL level (mnanonics versus Boolean logic) 

• Compact unit (less space) with LSI circuits 

• Changes may be "firmware" changes (1n PROMs) rather than 
physical changes 

• LSI supports a structured organization 
9 LSI has better reliability 

- approximately 80% of failures in the field are due to 
external connection failures (pins, etch) 

• Microprograrrming the control portion (CCU) allows: 

- hardware and firmware being designed in parallel 

- better documentation (structured microprogramming!) 

- development systems for microprogram development 

- development systems for prototype check-out 

• Overall better potential for better documentation 

- understandability 

• Potential for better diagnostics 

- separate switchable PROM 

- diagnostic routines on-board the control memory (PROM) 
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Stannary of Design Tradeoffs 



ITEM 



SSI/HSI 
HARDWARE 



2900 FAMILY 
FIRMWARE 



MICROPROCESSOR 
FIS MOS SOFTWARE 



architecture any desired almost any desired predesigned 



Instruction any desired any desired 

via wiring via microprogram 



predesigned 
may use software 
techniques to 
achieve desired set 



word length any desired multiples of 4 



fixed at 4,8,16,32 



execution 
speed 



100-200ns cycle times 



0.7 -5us cycle 



physical size 500 dips 50 dips 
(controller) small packages medium size 



3-6 dips 
large packages 



design time long, slow, parallel - fast software - fast 
to do correctly use aids - development systems 



docionentatlon tedious 



forced via programming techniques 



upgrades redesign 



change microprogram change software 



design cost highest 



medium 



lowest 



debug 



various aides exist - microprogramming development systems 
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IF YOU'RE GOING TO DESIGN ANY MACHINE, 
USE iNDUJiTRY STANDARD PRODUCTS 

True LSI! 

/Vn2900 family parts 
are 10 to 20 times 
as con9)1ex as 
traditional MSI 

The Am2900 family 

Is designed to be 

m i croprogrammed 

"The Am2900 family is 

the industry standard 

for bipolar LSI" 
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THE Ain2900 FAMILY ELEMENTS 

• CPUs (CCU + ALU) 

• Microprogram controners/sequencers 

• Bipolar memory (macro and micro levels) 

• Interrupt processing devices 

• Bus I/O interfaces 

• Direct memory access (DMA) devices 

• Timing/clocks 

• Macroprogram (machine languages) controllers/sequencers 
6 Multipliers 
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SOIE ELEMENTS OF Ani2900 PRODUCT FAMILY 



High speed mlcroprogrfflimable registered ALUs 

4-bit slice, 16 registers Am2901B 

Higher speed 4-bit slice, 16 registers Am2901C 

Speed selected version of 2901C Am2901C-l 

Expanded function 4-bit slice, 16 registers Am2903 

Higher speed version of Ani2903 Am2903A 

Enhancement of Am2903A, including BCD arithmetic Am29203 

16-bit microprocessor for high speed control Am29116 

Multiport, pipelined processor, 8-bit slice Am29501 



ALU auxiliary circuits 

Carry lookahead Am2902A 

Status and shift control unit for 2901, 2903, 29203 Am2904 



Register file extensions for ALUs 

16-word by 4-bit two-port register file, for 2903 Am29705 

Higher speed version of 29705, for 2903A Am29705A 

16-word by 4-bit two-port register file, for 29203 Am29707 



Microprogram sequencers 

4-bit sequencer slice Am2909A 

12-bit single-chip sequencer, for up to 4k microwords to2910 

Speed selected version of Am2910 Am2910-1 

Fastest (IMOX) version of Am2910, plus deeper stack Am2910A 

4-b1t sequencer slice, compact version of Am2909A Am2911A 

4-bit program control slice Am2930 

4-b1t program control slice, compact version of 2930 Am2932 

Interruptible sequencer, 31-deep stack, 8-b1t slice Am29112 

16-way branch control unit, for 2909A and 29nA Am29803A 

Next address control unit, for 2909A and 2911A Am29811A 
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Clocks 

Single-chip clock, microprogrammable cycle lengths 



Am2925 



Interrupt control 

Vectored priority interrupt controller, expandable 
Priority interrupt expander 



Am2914 
Am2913 



Pipeline registers 

Diagnostics register, 8 bits 
Multilevel pipeline register, 8 bits 
Multilevel pipeline register, 8 bits 



Am29818 
Am29520 
Am29521 



Registered PROMs 

Registered PROM, 512 x 8 

Registered PROM, 512 x 8 

Registered PROM, 1024 x 8 

Registered PROM, 1024 x 8 

Registered PROM, 2048 x 8 

Registered PROM, 2048 x 8 



Am27S25 
Am27S27 
Am27S35 
Am27S37 
Am27S45 
Am27S47 
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ANALYZING AND DESIGNING A 

COMPUTER CONTROL UNIT 

(CCU) 
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DEVELOPMENT OF A COMPUTER CONTROL UNIT (CCU) 



The objective of this section is to develop an understanding 
of the function and use of a process sequencer. In order to 
describe the design of a sequencer 1n a logical manner, something 
is required for the sequencer to control. While the design 
concepts are applicable to any kind of process control, examples 
of a traffic light and a coffee machine will be presented later. 
Initially, a digital computer macroinstruction sequencer process 
will be used and an associated computer control unit (CCU) 
developed. 



The drawing shows the classical Von Neumann/Babbage architecture 
(5 basic units), with a few buffer-register details. The 
arlthnetic-loglc unit (ALU) includes some "scratchpad" local 
storage registers, the memory unit includes the memory address 
register (MAR) and the program counter (PC), and the control 
unit includes the instruction register (IR). This register 
receives the next machine (macro level) Instruction to be 
executed. It 1s the function of the CCU to decode the operation 
code (OP code) portion of the IR value and generate the sequence 
of control signals needed to direct the ALU, the memory and the 
I/O portions of the system (i.e. the system resources). 
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LOGIC 
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(ALU) 



^ 
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INPUT/ OUTPUT 
UNITS 



?Y 




3> 



IR 



COMPUTER 

CONTROL 

UNIT 



(ecu) 




DATA 
CONTROL 
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DETAIL VIEW: 



• A more detailed view of this architecture shows the 
level of support provided by the AMD Am2900 family 
of parts. 



• As can be seen, all of the components of a computer 
are supported with Am2900 chips. 



t For most of this discussion the controller portion is 
emphasized which is shown on the left hand side of this 
illustration. 
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SIMPLIFIED SYSTEM: 



§ In order to initially concentrate on the sequence 
controller (CCU) the remainder of the computer is 
simplified to 

- an ALU 

- the accumulator register (ACC) 



• This architecture is defined as a single-address 
structure since the other address (the ACC) is 
implied. Thus, 

- data comes into only one side of the ALU 

- the accumulator provides the second operand 

- the result of the ALU operation is transferred 
to the accumulator 
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OP CODE 




ACCUMULATOR 
REGISTER 



DATA OUT 
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CONTROL SIGNALS: 



In order to define the control signals, assume the ALU can 
perform the functions shown on the next page. Three function 
control signals are required. Five basic types of instructions 
can be supported by the ALU, as shown. 



In addition, the ALU needs one bit to provide a 1 or for 
the carry-In. This can be provided by the microword. This 
carry-in capability can be used in incrementing a register. 
Note that in a bit slice ALU configuration the carry-out of 
one slice would be connected to the carry-in of the next. 



t Outputs from the ALU include the numerical result of the 
operation, plus various status signals. Examples include 



- carry out 

- zero 

- negative 

- overflow 
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CONTROL 








LINES 


ALU FUNCTION 






$2 $2 Sq 


CiN = 


CiN =1 







A + B 


A + B + 1 






1 


B-A-1 


B-A 






1 


A-B-1 


A-B 






1 1 


AVB "A OR B" 






1 


AAB "A AND B" 






1 1 


?^AB "NOT A AND B" 






110 


A^B "A EXOR B" 






1 1 1 


AVB "NOT (A EXOR B)" 





MACHINE INSTRUCTION SUPPORTED: 

ADD 

SUB 

OR 

AND 

EXOR 
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HICROWORO FORMAT: 

The following page shows the microword format to control 

• ALU function select 

• Carry-in 

• ACC load (Input) 

• ACC enable (output) 

• Load OP code Into IR 
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MICROWORD FORMAT 



ALU 








FUNCTION, 


ACC 


ACC 


OP CODE 


CARRY 


LOAD 


ENABLE 


LOAD 
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4- 
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1 




1 




1 






OTHERS 

AS 
NEEDED 
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SIHPLE ecu: 



Each microinstruction contains the address of the next 
microinstruction to be executed in addition to the fields 
for the necessary functional unit control signals. The 
result is a single-sequence controller (i.e. no conditional 
decisions). Any microinstruction can unconditionally "jump" 
to any other microinstruction. Usually loops are not created 
in this addressing mode. 



The micro memory in this simple example is 2" words deep, and m bits 
wide, where 



m = a + c 



microword width (m) = # address bits (a) + # control bits (c) 
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THE SIMPLEST CONTROL UNIT 




NEXT 
ADDRESS 



n - bits 



LOAD NEXT ADDRESS 
ON RISING EDGE OF 
CLOCK SIGNAL 



MICROMEMORY 
ADDRESS 



PROM 



2" X (n + c) 



2 words 



c bits 



\7 



TIMING CONTROL 
SIGNALS TO SYSTEM 



CLOCK 
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TIMING DIAGRAMS 



t Now, consider designing at the logic level using timing 
diagrams that define the desired control signal operation. 
Specifically consider their binary value based upon a 
periodic interval (clock). 



use the rising edge of the clock as a measurement point 



the bit pattern formed by the time slice is defined as 
the microword 



• The following three pages present 



a timing diagram for a four-signal system 



- the timing diagram digitized on the clock edge 



the resulting program flow and the clocked microprogram 
that would generate the desired timing diagram 
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MICROPROGRAM 
MEMORY ADDRESS 


MICROPROGRAM 
MEMORY OUTPUTS 


A 


B 


C 


D 





1 


1 







1 





1 







2 













3 










1 


4 










1 


5 


1 








1 


6 


1 








1 



MICROPROGRAM 
FLOW 




1 
2 
3 
4 
5 
6 



This is the microcode for sequential execution. 
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CLASS EXERCISE 



Turn to the E02900A Exercise and Laboratory Manual 



Solve the simple traffic light problem by designing at the Boolean level 
using a state diagram to define the sequenced transitions between each 
desired light condition. The associated state code of zeros and ones is 
then used to define the microroutine. This problem could also be solved 
at the waveform level by initially defining the desired transitions in 
terms of zero-one transitions for each control signal. 
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ADDING PROGRAM FLOW CONTROL TO CCU 
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PROGRAM FLOW CONTROL ADDITION (conditional branches) 



Required microprograni flow should have the same characteris- 
tics as any computer program. I.e. sequence (continue), 
iteration (loop) and decision (branch) 1n order to implement 
an algorithm. The previous design permitted only sequential 
flow (a single sequence of microinstructions). Thus, the 
current CCU structure must be expanded to provide for these 
additional capabilities. 



The controller just described can execute one serial sequence 
of operations. In order to select from multiple sequences 
and to allow conditional branching, further addressing hard- 
ware is necessary. The current CCU configuration will be 
enhanced with additional hardware to provide this capability. 



A means must be provided to select from two microaddress 
sources. Thus, a tri-state bus is used. Since only one 
source may be actively connected to this bus at any time, 
each source requires an enabling signal to allow it to be 
selectively enabled and disabled. 
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Application of Tri state Gates 
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The Tri state Gate Syrabol 
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Only Bit is illustrated, all other bits would be attached similar^ 
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PROGRAM FLOM CONTROL (Cont'd) 



A "load counter" signal allows the counter to be loaded from 
one of these tr1 -state sources or to simply be incremented. 



The block labeled "logic" decodes a 2-bit value from the 
microword "next address select field" to generate these 
three control signals. An alternate approach would be to 
provide three separate bits in the microword for these three 
signals. 



The multiplexer (MUX) and polarity circuits provide the test 
signals for conditional jumps, and will be developed in more 
detail later. Likewise, the instruction register (IR) and 
its associated mapping PROM, which allow the introduction of 
new micro-addresses, will be developed later. 



Note that in fonnatting the microword, the microinstruction 
next microaddress sequence fields are grouped to the left, as 
previously suggested, in order to provide more structure and 
readability in the code. Grouping in any manner provides for 
understandability. 
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HULTIPLEXER 



strobe Select 
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Block Symbol 
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01-1 




10-2 
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Output 
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General Computer Control Unit (CCU) Architecture 

Each block will now be discussed In terms of its operation 
associated with sequencing microinstructions. 



INSTRUCTION 




TO ALU 
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LINEAR SEQUENCES ("CONTINUE" microinstruction) 



In programming, quite often one instruction follows another. 
This is true of microprogramming as well. In the CCU, this 
is facilitated by using a counter register instead of the 
general register as previously shown. This counter contains 
the address of the current microinstruction, and can be 
incremented to the microaddress of a sequential flow is 
desired. 



The "next address select" field would contain the necessary 
bit pattern to disable the counter load control, allowing 
the counter to increment on the next clock pulse. Since 
three control signals must be generated, two bits would be 
needed for this encoded field. Whatever the actual bit 
pattern, the mnemonic "CONT" Is assigned for a "continue" 
microinstruction. The other fields of the microword are 
not used in this mode, and are mnemonically represented as 
"X" for "don't care". 



For example: 

MICROINSTRUCTION SEQUENCER MICROOPERATIONS 
FLOW NEXT COND BRANCH CONTROL 



o 





NEXT 




CONO 


BRANCH 


AODR 


POL 


SEL 


ADDR 


CONT 


X 


XXX 


XXXX 


CONT 


X 


XXX 


xxxx 


CONT 


X 


. XXX 


XXXX 
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MULTIPLE SEQUENCES (JMAP) 



The controller can still execute only one sequence with the 
mnemonic "CONT". 



In order to execute multiple sequences, the ability to exit 
the current sequence is required and a new starting address 
from some storage location must be provided, i.e. a jump 
(conditional or unconditional) capability. 



The input to the counter can be used for this purpose (a 
jump address). Various sources are examined as sources for 
this address. 



First consider the interpretation of a new macro level 
instruction. Once the counter is loaded with a new 
microroutine starting address, each microinstruction in 
this microroutine sequence could have a "CONT" in the next 
address select field, except possibly for the last one. 

The microinstruction would also contain one bit fields to 

- enable the counter load control for external 
data (address) input 

- enable the tri-state output signal of the mapping 
PROM which is driven from the macroinstruction 
register (op-code field). 



The mnemonic "JMAP" is used to represent this "jump via 
the mapping PROM". 
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MICROMEMORY ADDRESS SOURCE 



Consider now the new microaddress source for the counter 
in more detail. 



In a digital computer, the starting micro-address is 
dependent upon the current machine (macro) instruction. 



In a controller with no macro level instructions, the 
starting micro-address is dependent upon the current 
external "command" which must supply a micro-address. 



The computer control unit (CCU) is used as an example, but 
the design approach is common to both. The CCU accepts 
either a control command or a machine instruction (OP code) 
as directly or indirectly defining a macro-address which 
lends to a sequence of microinstructions. 



• Thus, to be able to control which microroutine is to be 
executed based upon a macro instruction 

- Add a macroinstruction register (IR) 

- Add the IR "load control" bit to the microword format 

- Gate the opcode portion of the macroinstruction to 
the counter as the starting address. 



Note: A PROM mapper is not used in this simple case. Thus the 
number of opcode bits cannot exceed the microprogram address 
width. If it equals the microaddress width, there can only be 
one microword per macroinstruction (assuming unique opcodes). 
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OP-CODE MAPPING PROBLEM - There are typically fewer bits in 
the opcode than in the microaddress for example, 
let there be 



where 



X bit opcode and n bit counter 



X < n 



SOLUTION 



One approach is to input on the remaining least 
significant microaddress lines: 



OPCODE 



-^ 



■f n- 



COUNTER 



1 



n 



ADDRESS 



START ADR - 

START ADR - 

START ADR- I 

I 

START ADR ~^ 



2^"^ WORDS 



EXAMPLE 



X = 8 

n = 12 

n-x = 4 



This permits 16 microwords (2^=16) per sequence or microroutine. 
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Examine the micromemory: 

START ADDRESS: 
START ADDRESS: 
START ADDRESS: 
START ADDRESS: 
START ADDRESS: 
START ADDRESS: 




SS33XS33 



» 




^tsssss 



16 MICROWORDS 
< 16 MICROWORDS 

> 16 MICROWORDS 



PROBLEM 



« What about microroutlnes of less than 16 mlcrowords? 

- Fragmented control memory 

• What about microroutlnes of more than 16 microwords? 

- Lose starting address and its associated macro OP code 



SOLUTION 



e Add a micromemory address decoded (mapper) 
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OP CODE 



MICROPROGRAM MEMORY 



4 BITS 



15 WORDS, 
B BITS WIDE 



MAPPING 
PROM 



START ADDRESS 

START ADDRESS 
START ADDRESS 

START ADDRESS 



START ADDRESSES 
GATED THRU COUNTER 

















VARIABLE 
LENGTH 







256 WORDS, 32-128 BITS WIDE 

ANY 16 OF THE 256 

LOCATIONS CAN BE USED 

AS THE START ADDRESS 



ROM/PROM 
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FURTHER SUGGESTIONS: 



Use a larger mapping PROM to provide for privileged macro 
instruction operation or detection by adding address lines 
driven by the console switches or the PSW (processor status 
word ~ usually ACC value plus ALU status bits). 



• Privileged instructions without the privileged bit set, map 
into a common "trap" microroutine. 



Provide for more addressing capability than is needed in 
the initial design. 



Provide for expansion in either of these directions in the 
initial design. 
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OP CODE 



MICROPROGRAM MEMORY 



4 BITS 



PRIVILEGED 
STATE SELECT 
FOR TESTING 



64 WORDS, 
8 BITS WIDE 



MAPPING 
PROM 



START ADDRESS 

START ADDRESS 
START ADDRESS 

START ADDRESS 



START ADDRESSES 
GATED THRU COUNTER 
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• 



• 



MICROPROGRAM CONTROL REVISITED 



Structuring of the microprogram can be accompli shed with the 
same conceptual program structures which exist for high level 
languages. A more extensive list based upon sequence, branch 
and iteration is: 



CONT (sequence) 

GO-TO (unconditional branch or juip) 

IF-THEN-ELSE (conditional branch) 

IF-THEN (conditional branch) 

DO X (Iteration) 

DO UNTIL P = TRUE or DO WHILE P = FALSE (Iteration) 

On X GO-TO (case statements/conditional branch) 



These various control flow operations are now presented for 
the previous microsequencer architecture in more detail. 
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UNCONDITIONAL JUMP (JP) 



• In order to jump to another microaddress from the middle of 
a linear sequence, a new address is again required. The 
input to the counter will be used, but this time the new 
address will come from the current microinstruction. 



• The next address select field would carry a bit pattern to 

- enable the counter load control 

- enable the tri -state gates from the microword 

branch address field 

• The mnemonic "JP" is used for this next address operation 



For example 
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ADDR 
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EXAMPLE - JP 



NEXT ADDRESS SELECT 



PROM 
ADDRESS 



FORWARD 
BRANCH 



1 
13 

14 







e 



1 



START: 



50 



51 



52 



BRANCH ADDRESS 



CONT 


= 


m 


JMAP 


= 


01 


JP 


= 


10 










X 







X 







X 
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90 








X 







X 

> 




1 


13 

„ > 


\ 




START 
NEXT OP 



SEQUENTIAL 
EXECUTION 




BACKWARD 
BRANCH 



THESE BITS ARE "DONT CARE" 
FOR THIS OP CODE 

FOR THIS OP CODE THEY ARE 
AN ADDRESS 
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EXPLANATION: 

50) Start address of routine 

50 is an address in the PROM mapping 
Continue to 51 

51) Continue to 52 

52) Continue to 53 

53) Go to 90 (jump to 90) - JP 

- The branch address is selected to be active 
and loaded into the counter 

- Note how both fields participate 

90) Continue to 91 

91) Continue to 92 

92) Go to 13 

13) Continue to 14 

14) Go to next sequence start address - JMAP 

- Note that the branch address field values 
are don't care 
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MICROPROGRAM RETURN FLOW CONTROL 



In a ecu microprogram it Is usually required to return to a 
common (shared) micro instruction sequence before Jumping to 
the next microroutine: This is required in order to get the 
next macro instruction from main memory, thus the following 
steps are required: 



- microaddress 13 might be the macro instruction fetch step 



microaddress 14 would be the op-code decode step to control 
a microaddress 



COMMON 
CODE 



CONT 
JMAP 



1AF ^^ L i 4 



13 INSTRUCTION FETCH 

14 DECODE STEP 




OTHER SEOUENCES 



POSSIBLE SHARED END 
OF SEQUENCE STEPS 
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CONDITIONAL JUNPS OR BRANCHES: 



• During execution of certain opcodes, it is often desirable to 
end a microroutine dependent upon the result of a logic test. 
For example, a check made on a hardware status line. 



For example, Add two numbers and check for 

- overflow error - do one microinstruction sequence 



no overflow error - do a different sequence of 

microinstructions 



or. Add two numbers and do 



on carry-out = 1; one microroutine 



on carry-out = 0; a different routine 
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OTHER TESTABLE CONDITIONS MAY INCLUDE: 



logica l 
expression 



mnemonic 



ACC 


= 





ZERO 


ACC 


> 





SIGN 


OVERFLOW 




OVR 


CARRY 


= 


1 


COUT 


A 


> 


B 


GTR 


A 


< 


B 


LESS 



interrupt request IR 

error status bit set ES 
Invalid instruction bit set II 



A specific control flow example is shown in the figure where 
if the condition Is true, the CJP next address selection will 
be microaddress 85. If the condition is false, the next 
microaddress is 54. 
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CJP (address) 



50 



51 



52 
CJP 53 (^ 



:. CONDITION TRUE 



54 



55 



56 



CONDITION 
T FALSE 



85 



86 



87 



*> 88 



t 
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CONDITIONAL JUMP (CJP) 

In this instruction the micro-address Is also provided from 
the microinstruction branch address field (same as JP). The 
next address select field code would 

- test the condition code input 

- IF the condition code is TRUE, then 

(1) enable the counter load control 

(2) enable the tri-state gates from the 
microword branch address field 

- ELSE (condition code FALSE) 

(1) disable the counter load control 
The mnemonic "CJP" is used. 



In order to allow testing one of several available conditions 
(overflow, negative, zero, etc.) another multiplexer is used. 
To allow for testing for either TRUE or FALSE conditions, a 
polarity selector is used. Both the choice of condition and 
the choice of polarity is controlled from the microinstruction. 



Note that a constant TRUE and a constant FALSE are shown as 
inputs to the MUX. This allows an alternate way to do 
unconditional jumps with a "CJP". 
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INSTRUCTION 
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EXAMPLE OF CURRENT CONTROL FLOM OPERATIONS 



The following page provides a sample microroutine (sequence) 
which demonstrated the four microprogram control flow 
mnemonics 



- JMAP 

- CONT 

- JP 

- CJP 



• There are three fields which are Important 



(next) address select 



branch condition select Including polarity 



- (micromemory) address select 



branch (micromemory) address 



• The next address select field determines the microinstruction 
type. 
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BRANCH 

CONDITION 

SELECT 



NEXT 
ADDRESS 

SELECT 



Example - CJP 

coNT = m 

JMAP = 01 

JP = 10 

CJP = 11 



PROM 
ADDRESS: 

START: 



13 



14 



CONDITIONAL 

TEST < 
STATEMENTS 



30 
31 
32 
33 
34 



106 
107 



1 



r 



BRANCH 
ADDRESS 



X 


00 


X 




X 


10 


30 








2 


11 


56 




1 


11 


95 




X 


00 


X 




X 


00 


X 


4 


1 


11 


106 


< 







UNCONDITIONAL 
BRANCH 



TEST CONDITION 2 - FAIL 



TEST CONDITION 1 - FAIL 



TEST CONDITION 1 - TRUE 



X 


00 


X 




) 


X 


01 


X 







CONDITIONAL BRANCH 



START NEXT OP 



Si, So CHOOSE CONDITION TO BE TESTED, IF ANY. 
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EXPLANATION: 

13) CONT - first microaddress 

14) JP - unconditional jump to microaddress 30 

30) CJP - jump to microaddress 56 
if condition 2 = TRUE 



assume C2 = FALSE 



31) CJP on condition 1, "assume FALSE" 

32) CONT 

33) CONT 

34) CJP on condition 1, "assume TRUE", GO TO 
microaddress 106 

this time C-^ = true 
60 TO 106 

106) CONT 

107) JMAP - unconditional Jump 
select mapping PROM output 
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CUSS EXERCISE: MICRO-PROGRAM CONTROL 



The purpose of this exercise is to develop additional 
understanding of microprogramming architectures through 
a simple example. 

Consider the simple computer presented at the beginning of 
this section. With the control fields added, the microword 
is defined as follows: 



MICROWORD FORMAT 



NEXT 

ADDRESS 

SELECT 



POLARITY 



# of n bits 



f 

1 



BRANCH 

CONDITION 

SELECT 



3 



BRANCH 
ADDRESS 



LOAD 
IR 



^ 



ALU 

FUNCTION. 

CARRY 



1 



f 

4 



MICROINSTRUCTION 
ADDRESS SEQUENCER 



LOAD 
ACC 



• • • 



1 



MICROINSTRUCTION 
CONTROL (ALU...) 



NOTE: Another way of stating requirements Is through the use 
of a flow chart defining specific RTL sequential operations. 



For the structured flowchart on the next page, write the 
microcode for the sequencer portion of the microinstructions, 
Define mnemonics where needed. 
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( START j 



READ DATAIN-->ACC 



YES 




ACC t- ACC + DATAIN 



YES 




-<^ ACC = 



ACC «- ACC V DATAIN 



DATAOUT<— ACC 




ACC<-0 




NO 



DATAOUT *- ACC 



NO 



LOAD IR 



JUMP MAP 



This flowchart does not represent a real-world algorithm, 
but is useful as a pedagogical example. 
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ENCODING OF MNEMONICS (bit patterns are arbitrary examples) 



CONT = 00 
JMAP = 01 
JP = 10 
CJP = 11 
TRUE = 1 
FALSE = 



continue 
jump map 

unconditional jump 
conditional jump 
condition true 
condition false 



ZERO = 000; test for ALU result = 



SOLUTION 



FLOW NEXT 
IMM ADDR*) ADDR* 



COND BRANCH (REGISTER TRANSFER 
POL SEL ADDR* CONTROL LANGUAGE) 



1 


CJP 


FALSE 


ZERO 


6 


DATAIN ~ ACC 


2 


CJP 


FALSE 


ZERO 


4 


ACC — ACC + DATAIN 


3 


CONT 


X 


XXX 


XXX 


ACC — ACC V DATAIN 


4 


CONT 


X 


XXX 


XXX 


DATAOUT ~ ACC 


5 


JP 


X 


XXX 


1 


ACC - 


6 


JMAP 


X 


XXX 


XXX 


LOAD IR** 



* All addresses are micromemory addresses 
** Assume macroinstruction prefetch 
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IMPROVING ecu SPEED 
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TIMING CONSIDERATIONS 



Consider the CCU with the ALU attached as shown in the 
figure on the next page. Note that the condition code 
MUX and address logic are combined into one block. 



Note also the addition of a status register between the 
ALU and the condition code multiplexer. This allows a 
test on the result of the previous operation, and increases 
speed as will be seen later. 



In order to determine the clock period, it is necessary to 
time the signal flows from the time they leave a register 
until they are ready to be clocked into another register. 
This must be done for all such paths. The slowest register- 
to-register path determines the lower bound on clock speed 
(microcyle). 



« For example, the main path delays in the CCU itself are: 

- clock to output of the counter 

- read-access time of micromemory 

- set-up time for the counter (except for CONT) 

- in parallel with the above, time through the MUX 
and set-up time for the counter load 

*CP ~ ^CL to output "*■ ^read access ■*" ^set-up 
since t^^^ (je^ay + tget^p is shorter 

e In order to examine speed improvements in the CCU, consider 
the timing paths including the ALU. 
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ADD THE ALU 
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TIMING CONFUTATION 



• The timing for this implementation is computed by examining 
all sequential paths. Two of these are of interest in 
developing our CCU: 



First: 

1. Clock to output of counter 15ns 

2. Fetch instruction 50 ns 

3. ALU to status line 95 ns 

4. Status register set-up 5 ns 

Total = 165ns 
And second, in parallel: 

Steps 1. and 2. 65 ns 

3a. ALU instr to output 120 ns 
4a. ACC set-up 5 ns 

Total = 190 ns 



The minimum microcycle required is the time of the longest 
path Cp = CNTR(15) + MEM0RY{50) + ALU(120) + ACC(5) = 190 ns 



ADVANCED MICRO DEVICES Z\ 



ALU 



DATA IN 



I 

m 
D 



ft 
m 

(A 




ecu 



»cc 



CONDirrON 2 



CONDITION 1 



GROUND 



MUX 



COND. 



INSTRUCTION REGISTER 



MUX 



LOAD COUNTER 



MICROPROGRAM 
MEMORY 



ADDRESS 
SELECT 



BRANCH 
ADDRESS 



PIPELINE REGISTER 



CLOCK 



OTHER 



CLOCK 



CONTROL 
SIGNALS 



I 

<£> 

O 



m 



I 

o 



1-1000 



ED2900A 



1-1000 



CONTINUING EVOLUTION OF SEQUENCER 



A fairly powerful sequencer has evolved in terms of the 
instruction set (next address selection) it can support. 
However, speed is another criteria. Sane additional 
improvements can be made to increase speed of operation. 



For this development, the execution of a conditional branch 
is analyzed, both with the branch taken and with the branch 
not taken. 



CONDITIONAL BRANCH (CJP) 



1+10- 



1+2 



b 
b+1 



Note that although several things seem to take place 
"simultaneously" during a single microcycle, some of 
them actually occur sequentially within a microcycle 
due to asynchronous nature (non-clocked logic delays) 
of the hardware. 



Note also that there is no difference in flow when the 
branch is taken as shown in the BRANCH TAKEN diagram. 
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Branch Taken 
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RESULT OF 
M-INSTb 



Branch on result of previous instruction. 
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(Current Architecture with Branch Taken 
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PROBLEM HITH NONPARALLEL USE OF FUNCTIONAL UNITS 

• Memory fetch idle during ALU execute 

• ALU idle during memory fetch 

• Wide or long micro-cycle (relatively slow) 



A SOLUTION 



Add a pipeline register (buffer) at the output of 
the ROM (PROM). The pipeline register then buffers 
the "flow" of data in the logic (pipe) so that 
independent functional units can act in parallel 
(concurrent operation) for reduced microcycle timing. 



A two-level pipeline results in the current design 
with: 



1) counter register 

2) pipeline register 
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Pipeline Concept 
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Branch Taken 
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ADDITIONAL ARCHITECTURAL IMPROVEMENTS 



Further improvement can be made by moving the counter out of 
the path of the branch address, and replacing it with a 
combinatorial logic incrementer and a microprogram counter 
register (uPC). The incrementer generates the next sequential 
address during the clock cycle with only a gate delay. 



• A multiplexor is added to allow either the micro PC register 
or the tri -state bus to be selected as the address source to 
the micro memory. 



• Note that the tri -state output on the pipeline Is for the 
branch address field only. 



This architectural change eliminates the problem of a lost 
cycle when the branch Is taken and allows the controller to 
run at full speed all the time as shown in the following 
diagrams: 
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Pipeline Concept with Incrementer 



Incrementer 
uPC 



Micro- 
memory 
Fetch 



Pipeline 
Register 



ALU 
Execution 



Status 
Register 







I 






i + 1 


CP 


i 






Addr 
i 


















p-^(MUX)|f- 


■J 






j. 






Instr 
i 




> 


» 




Instr 
i - 1 






1 

1 


' 




Instr 
i - 1 




1 


■ 




Results 




1 ■ 


" 


'- 









ADVANCED MICRO DEVICES Zi 



(revised) 



O 



PC HEGISTEn 



INCREMEHTER 



I 

ft 

m 
ft 

ft 
m 




CLOCK 



b + 1 



branch address 
= b 



CLOCK 



<i + 1> 



CONTROL 
SIGNALS 



o 
re 



o 



TIMING 



1-1160 



ED2900A 



1-1160 



Pipeline with Incrementer - Branch Taken 



Incrementer 
uPC 



Micro- 
memory 
Fetch 



Pipeline 
Register 



i + 1 



Addr 
1 



CP 



xnstr 
i 



T 



Instr 
i - 1 I* 



HLU 

Execution 


Instr 
i - 1 




> 


* 


Status 
Register 


Resi 
i - 


Jits 
- 2 



^ 



Addr 
I + IK 






.-J 



Instr 

i + KCJP) 



Instr 
i 



Instr 
i 



Results 
i - 1 



CP 



Incrementer 
uPC 



Status 
Register 



{ i-t-3 or 't>+\ 



Addr 
i + 2 



CP 



Micro- 
memory 
Fetch 




Instr 
i * 2 
or b 








s 


» 




Pipeline 
Register 




Instr 

i + KCJP) 






1 








1 


' 




ALU 
Execution 


In£ 
i H 


;tr 
h 1 





Resul ts 
i 



b + 2 



Adcr 
b -f 1^ 



CP 



Instr 
b + 1 



Instr 
b K 



Instr 
b 



Results 
i + 1 



ADVANCED MICRO DEVICES H 



1-1170 



ED2900A 



1-1170 



No Branch 



CLOCK 



INCREMENTER 



ju CYCLE 



u L_r 



M PC REG 



MEMORY 



PIPELINE REG 



ALU 



ACCUMULATOR 



AilNSTi+ 1 
ADR 



julNSTi 
ADR 

FETCH 
ju INST i 

^ilNSTi- 1 



EXECUTE 
H INSTi- 1 



RESULT OF 
M INST I- 2 



H INSTi + 2 
ADR 

julNSTi + l 
ADR 

I 

FETCH 

ju INSTi+ 1. 

M INST i 



EXECUTE 
/i INST I 



RESULT OF 
/i INSTi- 1 



ju INST i + 3 
ADR 

At INST 1 + 2 
ADR 

FETCH 

M INSTi + 2 

/ilNSTi+l 



EXECUTE 
MlNSTi+ 1, 



RESULT OF 
M INSTi 



M INST 1 + 4 
ADR 

M INST I + 3. 
ADR 

FETCH 

H INST i + 3 

/iiNSTi + 2 



EXECUTE 
M INST 1 + 2 



RESULT OF 
julNSTi+1 



MlNSTi + 5 
ADR 

julNSTi + 4 
ADR 

FETCH 
MlNSTi + 4 

iu INST 1 + 3 



EXECUTE 
MlNSTi + 3 



RESULT OF 
)U INSTi + 2 



Final Version Architecture 



ADVANCED MICRO DEVICES Z^ 



1-1180 



EDZ900A 



1-1180 



Branch Taken - No Penalty 



CLOCK 



INCREMENTER 



AiPCREG 



MEMORY 



PIPELINE REG 



ALU 



ACCUMULATOR 



M CYCLE 



;ilNSTi+1. 
ADR 

/I INST i ADR 
ADR 

FETCH 
/ilNSTi~1 

/ilNSTi-1 



EXECUTE 
MlNSTi-l 



RESULT OF 
MlNSTi-2 



/ulNSTi + 2 
ADR 

MlNSTi+ 1 
ADR 

FETCH 
MlNSTi+1, 

M INST i 



EXECUTE 
H INST i 



RESULT OF 
MiNSTi- 1 






plNSTb+1 
ADR 

MlNSTi + 2 
ADR 



\ 



FETCH 
H INST b 



AilNSTi + 1 



EXECUTE 
pINST i+ 1, 

(COND 
BRANCH): 

RESULT OF 
II INST i 



iulNSTb + 2 
ADR 



iu INSTb+ 1 
ADR 

FETCH 
MlNSTb+ 1 

At INST b 



EXECUTE 
/ilNSTb 



RESULT OF 
/ilNSTi+ 1 



plNSTb + 3 
ADR 

MlNSTb + 2 



FETCH 
;ilNSTb + 2 

MlNSTb+1 



EXECUTE 
/ilNSTb+1 



RESULT OF 
/i INST b 



ADVANCED MICRO DEVICES n 



1-1190 ED2900A 1-1190 



FURTHER IMPROVEMENTS IN MICROPROGRAM CONTROL 
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SUBROUTINE CONTROL FLOW (branching) 



There are cases where a branch to a routine and then a 
return to the main microprogram flow upon the routine's 
completion is desired. It may be desired to do this 
branching from several different places in the main 
program. 



Subroutine organizations, as used in other programming 
languages, provide a structured way of accomplishing this 
task. 



• The ability to perform nested subroutines is also desired, 
that is, where one subroutine can call another subroutine 
and so forth. 



• Subroutines support structured programming concepts, 
especially the implementation of modular code and 
functionality. 



To facilitate these features, the following capabilities 
are required to perform a subroutine (a branch and return 
sequence): 

- a stack to save the micromemory address 

- a top-of-stack (TOS) pointer 

- a means of accessing the top of the stack 

through another input to the micromemory address MUX 

- logic to control the stack operations 
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SUBROUTINES: 



• Subroutines should be callable from anywhere in the 
microprogram. 

As with jumps/branches, subroutine calls can be conditional 
or unconditional. 



• At the completion of the subroutine, control returns to the 
main macroprogram statement following the calling statement. 
This is an unconditional return. 



« A return can be permitted prior to the completion of the 
subroutine based on some logical condition. This would be 
by definition a conditional return. 



• Defined mnemonics are: 

CJS - conditional jump subroutine 
CRTN - conditional return 



• Assume forced TRUE conditions will be used to implement 
unconditional calls and returns. 



The "logic" will control PUSHIng the return micromemory 
address onto the stack and POPping the stack on return. 
The POP operation logically connects the value (microaddress 
on the top of the stack} to the S input on the microaddress 
MUX. 
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NESTED SUBROUTINES 



• Occur where one subroutine calls another 



The best way to handle multiple return addresses is via a 
last In, first out stack and a top of stack (TOS) pointer 
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TOS = TOS - 1 
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EXAMPLE: 

• The following microroutine demonstrates a subroutine call 
and return: 



NEXT P CONG BR 
AOOR MUX AODR 
SELECT L SEL 



STRT:31 CONT X X 
32 CJP TEST LI 



32 CONT X X 

33 JP X L2 



LI: 104 CONT X X 

105 CONT X X 

L2: 106 CJS TEST L3 

107 CONT X X 

108 JMAP X X 



L3: 547 CONT X X 

548 CONT X X 

549 CONT X X 

550 CRTN PASS X "unconditional return' 



Possible next address controls for our CCU so far: 

CONT Continue 

JP Go to branch address 

CJP If condition true then go to branch address 

JMAP Go to mapping PROM output (start address) 

CJS If condition true then go to subroutine address 

CRTN If condition true then go to <TOS> 

" ADVANCED MICRO DEVICES £1 



1-1280 ED2900A 1-1280 



LOOPS (ITERATION) 



• There are many algorithms that require one or more 
statements to be repeated for X number of times (DO loop) 

• One way to implanent a X-times loop Is via a loop starting 
address and a decrenenting counter. 



o Example - 

BEGIN LOOP: 

REGISTER < — START ADDRESS 

COUNTER <-- X - 1 note counter is 1 less 

than times loop is executed 

END LOOP: 

IF COUNTER = (3 GO TO <uPC> (leave loop and continue) 

IF COUNTER ?« GO TO <RE6ISTER> (loop again at 

START ADDRESS) 



« Note that loop's starting micromemory address could also be 

stored in the branch address field at the last microinstruction 
in the loop instead of the register (an additional required 
storage location). 



• A loop may also occur where one or more statements are 
repeated until some condition exists or event occurs 
(referred to as DO-HHILE or DO-UNTIL loops). 

IF TEST = TRUE GO TO <uPC> 

IF TEST = FALSE GO TO <REGISTER> 
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LOOPS 
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IF COUNTER / 

OR 

IF CONDITION = FALSE 
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MODIFIED SEQUENCER STRUCTURE FOR LOOP ITERATION: 



A counter was added to hold the loop count. A source is 
needed to hold the original value of the count for transfer 
to the counter. Another field in the microword could be 
added. However, an overlapped or shared field could be used. 

A shared field is a field that has one meaning for some 
operations and another meaning for other operations. Often 
an extra bit is added to the microword to indicate which 
meaning is being used, but in this case the next address 
select field does the job. 



Sharing fields {also called vertical microprograiming) 
should be used with care. However, the example under 
consideration is commonly used with Am2900 parts. 



The branch address field (which is only used during jump or 
CJS instructions) is "overlapped" with the counter value 
field. Note that the count is thus limited to n bits. 

Some type of next address select code is needed that will 
determine the location (register, microinstruction, stack) 
of loop starting address. 

Finally, an extra tri-state enable is added for flexibility 
for selecting other external microaddress values. In this 
development, it will be used for enabling Interrupt vectors - 
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SEQUENCER NEXT ADDRESS CONTROL 



• The next figure presents the logic block diagram for next 
address control. The following signals are thus defined: 



e Inputs: 

- Next address select from pipeline (microword) - assume 
4-bits will suffice 



- Condition code; - output of condition code MUX 

- OE (low); - allows all outputs to be tri -stated 

• Outputs: 

- Three output enables for tri -state sources 

MAf> - for mapping PROM 

PIPELINE - for pipeline branch address field 

VECT - extra (intended for interrupt vectors) 

- MUX select for control of the micromemory address MUX 

- Counter load and enable for loop counter control 

- FE file enable causes a stack operation 

- PUP determines stack push or pop 
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Summary of Next Address Control 
Logic Block 
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ecu IMPLEMENTATIONS 
USING Am2900/Am29100 FAMILY PARTS 



ADVANCED MICRO DEVICES t^ 



1-1360 E02900A 1-1360 



MICROSEQUENCER SELECTION 



• There dre three choices of Am2900 chip sets available for 
implementing a control unit. 



• The first consists of the Am2910 microprogram controller. 



e The second is the Am29112 microprogram controller. 



» The third consists of the Am29811 next address control unit 
with either the Am2909 or Am2911 microprogram sequencer 
(bit slice). 
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PRIMARY DIFFERENCES BETWEEN APPROACHES 
ARI2910 



• The Am2910 Is a single package, containing sequencer, next 
address control logic, and a combined counter/register. 



• The Am2910 is not a bit-slice, but has a 12-bit micromemory 
address output (4K micromemory addressing). 



The Am2910 includes vector-enable output 



AIII29112 



The Am29112 is similar to the Am2910 in general structure, 
but is an 8-bit slice expandable to two for addressing 64K 
of micromemory. 



The Am29112 stack is 33 registers deep. 



The Am29112 also features direct, multiway, relative and 
program-counter-relative addressing modes, along with vectored 
interrupts. 



(The Ain2910 will be emphasized with possible alternate 
capabilities discussed with the Am29811 and the Am2909/2911 
and the Ain29112) 
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Am2909/2911 SEQUENCERS 



The Am2909/2911 is a 4-bit sequencer slice, allowing any 
width of microprogramming addressing and requires next 
address control logic. 



The Am2909 has four Input bits OR'ed with its output for use 
with the Am29803 for doing 16-way branches (case statement). 



The Am29811 next address control logic has the same 
instruction set as the toi2910 except for the Am2910's 
three-way-branch . 
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Aai29112 In a Single Pipelined System 
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flm2910 DISTINCTIVE CHARACTERISTICS 



• Twelve bit address output 

• Four address sources - 0, R, File (Stack output), uPC 
e Internal loop counter 

• Five deep subroutine stack - Am2910, nine deep - Am2910A 
» Conditional test input 

o Sixteen powerful microinstructions 

• OE for three next address jump sources 
e Fast microprogram execution 



• Additional control pins 

(discussed in detail later) 



RLD - register latch 

CCEN - for forced pass 

CI - for inhibiting incrementer 
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Am2910 INSTRUCTION SET SlftWRY 
START: 

JZ Jump Zero (Reset) 

SEQUENCE: 

CONT Continue 

BRANCH: 

JMAP Jump Map 

CJP Conditional Jump to Pipeline 

CJV Conditional Jump to Vector 

JRP Conditional Jump Register or Pipeline 

CJPP Conditional Jump to Pipeline and POP Stack 
SUBROUTINE: 

CJS Conditional Jump to Subroutine (CJP and PUSH) 

JSRP Conditional Jump to Subroutine where Start Address 
is the Register or Pipeline 

CRTN Conditional Return 

LOOPING: 

LOCT Load Counter and Continue 

PUSH Push Micro-PC on Stack, Conditional Load Counter 
and Continue 

RPCT Repeat Loop if Counter = 0, Start Address on Stack 

LOOP Repeat Loop until TEST = TRUE, Start Address on Stack 

TWB Repeat Loop if TEST = FALSE and Counter = 

ELSE IF TEST = FALSE and COUNTER = 0, Go to Pipeline 

ELSE IF TEST = TRUE Continue 
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CONT Conttnue to Next Instruction In Sequence 
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OMAP Jump to Start Address (Enable Mapping PROM) 
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Figure 4-12. Conclilional jump pipeline (CJ['. 3). 
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Figure 4-13. Conditional jump vector (CJV. 6). 
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Figure 4-14. Load counierand coniinue (LDCT. C). This inslruclion must be 
executed before a loop instruction or a jump which used ihe register. 
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Figure 4-15. Conditional jump regiMer/pipeline (JRP, 7). LDCT must have been 
cxeculed somewhere ahead of JRP. 
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Figure 4-16. Condiljonal jump subrouline from pipeline (CJS, 1). 
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Figure 4-17. Condilional jump subroutine register/pipeline (JSRP, 5>. LDCT or a 
register load must occur somewhere prior lo JSRP. 
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Figure 4-18. Conditional return (CRTN, A). 
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Figure 4-19. Repeal pipeline if counter jt 0(RPCT, 9). (Loop on one or more 
sialemcnts, beginning address of loop in pipeline |al RPCT statement].) 
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Figure 4-20. Push Slack and conditional load counter (PUSH, 4). This instruction 
must immediately precede the first statement in a loop controlled by LOOP or RFCT, 
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Figure 4-21. Repeal loop from stack if counter /= OtRFCT, 8). 
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Figure 4-22. Test end of loop (LOOP, D). Musi be preceding first statement in loop. 



ADVANCED MICRO DEVICES H 



1-1750 



ED2900A 



1-1750 



LOOP 



MRRy 
OVR 
ZERO 
SIGN 
INTR . 
ETC 
ETC 



r 




ADVANCED MICRO DEVICES iJ 



1-1760 



ED2900A 



1-1760 



CJPP Conditional Jump to Pipeline and POP TOS 

(Use to exU from a loop which uses the stack) 



ADDRESS 
(HEX) 



2910 


COND 


BRANCH 


INSTR 


MUX 


ADDRESS 


PUSH 


FAIL 


# 


CONT 


# 


# 


CONT 


# 


# 


CJPP 


TESTJ 


ADRJ 


CJPP 


TESTK 


ADRK 


LOOP 


TESTL 


# 


CONT 


# 


# 



50 
51 
52 
53 
54 
55 
56 



80 ADRK: CONT # # 

81 CONT # # 

82 CONT # # 



90 
91 
92 



ADRJ: 



CONT 


# 


# 


CONT 


# 


# 


CONT 


# 


# 



PUSH 50 
CONT 51 
CONT 52 I 
CJPP 53 J 
CJPP 54 i- 
LOOP 55 (1-- 
CONT 56 i FAIL 



^•- 



PASS 



80 t 91 
92 



H— -i-* 80 1 1 
1—1 i 81 n 
1 FAIL f 82 



cc 


COUNTER = 
LINE 


STACK 


ADDRESS 
SOURCE 


REGISTER/ 
COUNTER 


OE 


PASS 
FAIL 


X 


POP 

NC 


D 

^PC 


NC 


PL 



Figure 4-23. Condilional jump pipeline and POP (CJPP, B). 
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Figure 4-24. Three-way branch (TWB. F). 
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POMERFUL THREE-WAY BRANCHING 



IF 
CONDITION 



■STTT^CK 




IF 

COUNTER = 



YES 



CONTINUE 
POP STACK 



CONTINUE 
POP STACK 
DECR COUNTER 



FAIL 




IF 

COUNTER = 



GO TO BRANCH 
( PIPELINE ) 
POP STACK 



GO TO BRANCH 

( STACK REFERENCE ) 

DECR COUNTER 
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EXAMPLE OF THREE-WAY BRANCH 



PUSH 63 o 




CONT 64 



CONT 65 



CONT 66 



TWB 67 (i ^ 



gA PUSH START ADDRESS 
OF ROUTINE ON STACK 



N^ LOAD LENGTH OF MEMORY 
TO BE SEARCHED 



(FETCH NEXT 
OPERAND; 
COMPARE TO 
KEY; ETC.) 



NO MATCH 



CONT 68 t 

MATCH FOUND 



COUNTER ^ 0; DECREMENT 



COUNTER = 



78 CONT 



79 CONT 



NO MATCH 
WITHIN MEMORY 
SECTION 
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Special Pins on Ani2910 



WOB Register Load 

• For the basic Instruction set, RED is held high 

• For causing the register to load on the f clock 
transition, regardless of the instruction, RLD is 
pulled low — whatever is on the bus is loaded 
into the register 



Ijjl Carry In 

• For normal operation, C^^ is held high 

To repeat an instruction, Cjfj is driven low 
( not normally under pipeline control or you 
may have an infinite loop!) 
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m 



Special Pins on Ain2910 



CUEH Condition Code Enable 



t CCEN = LOW; enables CO (TEST) input to 
operate normally 



« CCEN = HIGH; all conditional instructions 
are unconditionally true (TEST = PASS) 



Tri -state control of Yj outputs 



FULL Five items are on stack; use in diagnostic 
test programs; debug 
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NEXT ADDRESS CONTROL 
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MICROPROGRAMMED SEQUENCERS 
Am2909 / Ain2911 
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Using the Am29811A with the An2g09A/An2911A 



• Bit-slice architecture means more microword addresses due to 
more address lines, hence larger microprograms. (Sequencer 
width independent of ALU width.) 



• ORed outputs on Am2909A allows use of Am29803A for 16-way 
branch. 



• Separate register (Ri) and direct (01) Inputs on Am2909A for 
flexibility. 



• Am2909A and Am2911A speeds are comparable to Am2910. 
(See Data Book) 



e Could replace Am298nA with ROM for customer instruction set. 
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Ain29811 



JUMP ZERO (JZi 




I CONO JSB PI i;JS) 



SO 
SI 
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SS 






Z JUMP MAP IJMAP) 



SO I t 

tl " 
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M 
■I 



3 CONO JUMP PL ICJPI 
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51 
S7 

U " 
S4 I i 



4 PUSH/CONO LD CNTR IPUSHI 



5 CONO JSB R/PL IJ5RPI 



31 




e CONO JUMP VECTOR (CJVI 




7 CONO JUMP R/PL (JRPI 



SI 

SI 'I 



^r^ 




8 REPEAT LOOP, CNTR « (HFCTI 



9 REPEAT PL, CNTR yOlRPCTI 
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SI 
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SUMMARY OF 
NEXT ADDRESS CONTROL 
LOGIC BLOCK 



NEXT ADDR SELECT 
^ > 



COND MUX OUTPUT 



Am29811A 
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H 
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PL 



MUX SELECT 

FT - FILE ENABLE 
(STACK) 
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COUNTER LOAD 
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Ml croprograym Sequencer Block Diagram 
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2911 
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loads reg 
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OE 
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JP 
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OR^ input 
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needs Ani29811 self contained self contained 
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Aiii29803A 



There 1s another statement that can be used in structured 
code 



THE CASE STATEMENT 



An N-way conditional branch 



Used for choosing 1 of n paths based on one or more test 
results. 



9 For the Am29803A, 1 of 16 branches can be selected. 
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16-Way Branch 
(T3, T2, Tj, Tq) 
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Advantages of the Ain29803A 



• Allows any combination of up to four tests 
(16-way branch) to be decoded in two 
itiicrocycles. 



• faster than a series of conditional jumps 
and tests written in microcode. 



e Easier for microprogranming. 



ADVANCED MICRO DEVICES P 



1-1950 



ED2900A 



1-1950 



Am29803A 



FUNCTION TABLE 
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L 






L 




Ten To 


t 


L 


L 


H 


X 


X 


X 


H 
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A Typical CCU using the Am2909, Am2911, /to29803A and ytoi29811A 
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EXAMPLE 



• Show the microcode (partial width only) to program these 
statements, assuming an Am2903-Am2909/ll-Am29811 CCU. 



IF A THEN ON (T2T0) GO TO (10, 200, 30, 40)* 

ELSE ON (T3T1) GO TO (20, 200, 10, 20) 
IF 8 THEN ON (T3T2T1) 60 TO (10, 20, 30, 40,..) 

ELSE ON (T2T1T0) 60 TO (100, 200, 300,...) 

Where: 

A and 8 are condition multiplexer input lines. 

T3, T2, Til, TO are test inputs to the Am29803. 

10, 20, 200, etc. are labels of statements. 

The same label means the same statement. 

The statements may be considered to be the beginning of 
a microroutine of unknown length. 

* IF A IS TRUE, THEN 

IF (T2T0 = 00) GO TO 10 

IF (T2T0 = 01) 60 TO 200 

IF (T2T0 = 10) GO TO 30 

IF (T2T0 =11) 60 TO 40 
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Ain29803 SOLUTION 



1. LABEL/ 29811 MUX BR ADDR 29803 
ADDR INSTR SEL INSTR 



CJP A i + 2 NO TEST 



i + 1 


JP 


# 


350* 


hh 


i + 2 


JP 


# 


360* 


h^ 


J 


CJP 
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J + 2 


NO TEST 


3 + 1 


JP 


# 
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JP 
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JP 
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JP 
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JP 
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20 


NO TEST 


360 


JP 
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10 


NO TEST 


361 


JP 
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200 


NO TEST 


362 


JP 


# 


30 


NO TEST 


363 


JP 


# 


40 


NO TEST 


370 


JP 


# 


100 


NO TEST 


371 


JP 


# 


200 


NO TEST 



address must have 
as final HEX 
digit if LSS 2909 
attached to 29803 
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Am29112 
MICROPROGRAM SEQUENCER 
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Ain29112 In a Single Pipelined Systan 
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Am29112 CHARACTERISTICS 



Functional Description 



» The Ani29112 is a high performance interruptible microprogram 
controner Intended for use in very high speed microprograrmied 
machines and optimized for the new state-of-the-art ALU's and 
other processing components. 



« It has an instruction set featuring relative and multiway 
branching, a rich variety of looping constructs, and 
provision for loading and unloading the on-chip stack. 



Interrupts are accepted at the microcycle level and serviced 
in a manner completely transparent to the interrupted 
microcode. 
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DISTINCTIVE CHARACTERISTICS: 



• The Am29112 Is designed to operate 1n 10 MHz rti1 croprograimied 
systems. 



• A single Aiti29112 1s 8 bits wide and addresses up to 256 words 
of microprogram memory. Two Am29112's may be cascaded to 
directly address up to 64K of microprogram memory. 



• A 33 register deep on-chip stack Is used for subroutine 
linkage> interrupt handling and loop control. 



t Two kinds of Interrupts: maskable and unmaskable. 



• Features an 8-bit counter for loop control. When two 
Am29112s are cascaded, the counters can act as a single 
l6-b1t counter or two independent 8-bit counters. 



Features direct, multiway, multiway relative and program 
counter relative addressing. 



© Support for writable control store. 



e Hold feature - a hold pin facilitates multiple sequencer 
implementations. 
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t 



• 



Ani29112 OVERVIEU: 



The toi29112 Is designed for use 1n single-level pipelined 
systems. A typical configuration is shown on the next page. 



« Branch addresses, constants for the various registers and 
stack pointer values are supplied to the Am29112 through 
the D port which is bidirectional to allow the stack to be 
unloaded onto an external LIFO. 



The next address generated by the sequencer is output on 
the Y port and directly drives the micromemory program. 



A single register at the output of the microprogram memory 
contains the microinstruction being executed, while the 
next is being fetched. 



External conditions are applied to the CC input of the 
Am29n2 via the condition code MUX and also to the multiway 
inputs. 
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Am29112 Configuration 
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/Wii29112 OVERVIEW (cont'd): 



• A vectored, priority-interrupt controller generates a 
prioritized interrupt request (MINTR) to the Am29112, 
which acknowledges the request via the MINTA pin. Upon 
receiving the acknowledge, the priority-interrupt control 
puts out the encoded vector from the mapping PROM. The 
MINTA output of the Am29112 turns on the PROM output and 
simultaneously turns off the Y port, enabling the interrupt 
vector onto the microprogram address bus. In the Am29112, 
Internal states are automatically saved on the stack while 
the interrupt vector is transmitted through the Y port and 
incremented to form the next microprogram address. 



The emergency detect circuit generates an unmaskable 
interrupt request upon power failure or stack error. On 
receiving an unmaskable interrupt, the sequencer branches 
to the unmaskable interrupt routine; the address of this 
routine is stored on the Am29112 in the INTVECT register. 



• The internal organization of the Am29112 is shown in the 
figure. The most important control loop inside the 
sequencer consists of the CMUX, incrementer, and PC register. 
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Am29112 OVERVIEW (cont'd) 



The CMUX selects the next microprogram address based on the 
instruction and condition code inputs. The next microprogram 
address is selected from: the PC register for a continue, the 
D port for a branch, the adder for relative and multiway 
branches, the interrupt register for unmaskable Interrupts, 
the stack for subroutine returns or loop repeats, or all 
zeros for the JUMP ZERO instruction. 
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AB129112 INSTRUCTION SET 



HODE BITS 



The Am29112 is controlled by five instruction inputs, two 
mode inputs, and the condition code. In typical applications 
it is expected that the instruction inputs are driven directly 
from the pipeline, whereas the mode inputs are either perman- 
ently wired high or low to select the desired operating mode, 
or driven indirectly via external logic. (In some applications 
it might be Justifiable to drive the mode bits directly from the 
pipeline.) The two mode bits select among three operating modes: 
normal (0,0), extended (01) and forced continue (10 and 11). 
In the normal mode the entire instruction set of the Am29112 
applies. 



65 


nooe 


00 


Normal 


01 


Extended 


10 


Forced 


11 


Continue 



MODE CONTROLS 

Description 



For cascaded Am29112s, two independent 
8-bit counters 

For cascaded Am29112s, one 16-bit counter 

The Am29112 executes a continue 
instruction regardless of instruction, 
condition code, and multiway inputs. 
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Extended Hode: 



The instruction set includes the instructions that differen- 
tiate between upper and lower counters (when there are two 
cascaded Am29112s). In the normal mode, the two counters on 
cascaded Am29112s function independently. 



In the extended mode, however, the counters on cascaded 
Am29112s behave like one 16-bit counter and instructions 
that differentiate between counters degenerate into identical 
instructions. 



• The instructions of the Am29112 are classified into four 
groups: 

- branching and subroutine linkage 

- looping 

- stack and register 

- interrupt 



The sequencer has a repertoire of 40 different instructions 
In order to encode these instructions with only five 
instruction lines, the condition code is used in some 
cases to differentiate between two distinct instructions 
sharing the same opcode. 
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Opcode (I40) Condition 



/toi29112 INSTRUCTION SET 

Mnemonic Description 





1 
1 

2 

3 

4 

5 

6 

7 

B 

9 
10 
11 
11 
12* 
12* 
13" 
13" 
14 
15 
15 
16"' 
17"" 
18 
IB 
19 
20 
21 
22 
23 
24 
25 
26 
25 
27 
28 
29 
30 
31 



X 


JZ.U 


PASS 


PUSHD.P 


FAIL 


LDCMD.F 


COND 


POP.C 


COND 


CJD.C 


COND 


CJSD.C 


COND 


CJMW.C 


COND 


CJSMW.C 


COND 


CRTN.C 


COND 


PUSHPL.C 


COND 


LDLC.C 


PASS 


POPLC.P 


PASS 


RSTSP.P 


FAIL 


LDINTV.F 


PASS 


RFCTU.P 


FAIL 


RFCTL.F 


PASS 


RPCTU.P 


FAIL 


RPCTL.F 


COND 


LOOP.C 


PASS 


ENINT.P 


FAIL 


DISINT.F 


COND 


TWBL.C 


COND 


TWBU.C 


PASS 


TSTSP.P 


FAIL 


TSTMT.F 


COND 


CJDF.C 


COND 


CJSDF.C 


COND 


CJMWR.C 


COND 


CJSMWR.C 


COND 


CJPP.C 


COND 


PUSHPU.C 


COND 


UDUC.C 


PASS 


POPUC.P 


FAIL 


POPDW.F 


COND 


LDDW.C 


COND 


CJR.C 


COND 


CJRN.C 


COND 


CJSR.C 


COND 


CJSRN.C 



UNCONDITIONAL JUMP ZERO 

PUSH D (PASS) 

LOAD COMMAND REGISTER FROM D (FAIL) 

POP; CONDITIONAL STACKOUT TO D 

CONDITIONAL JUMP D 

CONDITIONAL JUMP SUBROUTINE D 

CONDITIONAL JUMP MULTIWAY D 

CONDITIONAL JUMP SUBROUTINE MULTIWAY D 

CONDITIONAL RETURN 

PUSH PC: COND LOAD LOWER COUNTER 

LOAD LOWER COUNTER; COND PUSH COUNTER 

POP TO LOWER COUNTER (PASS) 

RESET STACK POINTER (PASS) 

LOAD UNMASKABLE INTERRUPT VECTOR (FAIL) 

REPEAT LOOP, UPPER COUNTER » (PASS) 

REPEAT LOOP. LOWER COUNTER = (FAIL) 

REPEAT PIPELINE. UPPER COUNTER = (PASS) 

REPEAT PIPELINE. LOWER COUNTER - (FAIL) 

TEST END LOOP 

ENABLE INTERRUPTS (PASS) 

DISABLE INTERRUPTS (FAIL) 

THREE-WAY BRANCH, LOWER COUNTER 

THREE-WAY BRANCH. UPPER COUNTER 

TEST SP WITH D (PASS) 

JUMP D IF STACK NOT EMPTY 

COND JUMP D/STACK AND POP 

COND JUMP SUBROUTINE D/STACK AND POP 

COND JUMP MULTIWAY RELATIVE D 

COND JUMP SUBROUTINE MULTIWAY RELATIVE D 

COND JUMP PIPELINE AND POP 

PUSH PC: COND LOAD UPPER COUNTER 

LOAD UPPER COUMTER; COND PUSH COUNTER 

POP TO UPPER COUNTER (PASS) 

POP TO DISPLACEMENT WIDTH (FAIL) 

LOAD DISPLACEMENT WIDTH; COND PUSH DW 

COND JUMP D PC REL 

COND JUMP D PC REL NEGATIVE 

COND JUMP SUBROUTINE D PC REL 

COND JUMP SUBROUTINE D PC REL NEGATIVE 



'These instructions are identical In the extended mode. 
"These too. 
"•These too. 

Extensions: U - unconditional; C - conditional; P - PASS condition; F - FAIL condition. 

Note: PASS/FAIL condition can be produced as follows. P stands for polarity and I for input. 



CO 1 CCEN 


POL 


Condition 


X 1 





PASS 


X 1 1 


1 


FAIL 


1 j 1 P COND 
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Jump Zero (J2LU) 




1 Push (PUSHO.P) 



UNCONOmONM. 



M 


^ ' STACK 


M > 



1 Load Command Latch 
from (LDCMO.F) 



FORCED PASS 




COMKUND LATCH 



FORCED FAN. 



2 Pop and Unconditional Stackout 
to D (POP.C) 



3 Jymp D (CJO. C) 



4 Jump Subroutina (CJSO.C) 




cofnxnoHAE. 





5 Jump Multiway D (CJMW.Q 




6 Jump Subroutine Multiway 
O(CJSMW.C) 




CONOmONAL 



B Push PC and Conditional Load 
Lower Counter (PUSHPL.C) 



STACK 
(UNCOHDfnONAU 




LOWER COUNTER 
FASS 



7 Return (CRTN.C) 









1 








M I 


1 






21 1 


t 






»( 


y 


POP 




» 1 




■ 




2A • 








2B < 










PASS 

CONOmOHAI. 



9 Load Lower Counter and 
Conditional Push Counter (LDLC. C) 



LOWER COUNTER 
(UNCONOmONAL) 




CONDITIONAL 



COHOmONAL 



10 Pop to Lower Counter 
(POPLC.P) 




(STACK) 
LOWER COUNTER 



FORCED PASS 
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11 Reset Stack Pointer (RSTSP.P) 



(>- 



■0 



STACK COMTEK 



FORCED fAS» 



12 Repeat Loop, Lower Counter 
(RFCTL.F) 






„^-^ 


« , 


< 


STACK 


4B 1 




.,.1™% 


4C 




LOMTER COUNTER 


40 






4E 






80 , 


l>OP 




*1 


r 


-OnCEOFAC 



14 Test End Loop (LOOP.C) 




11 Load Untnaskable Interrupt 
Vector (LDINTV.F) 



16 Three-Way Branch, Lower 
Counter (TWaL-C) 




CONDmOMAt 



2C • I 
20 
2E 
»F • I 
30 



c>- 



-0 



IMTVECT 
REQISTEN 



FORCED FAn. 



13 Repeat Pipeline, Upper 
Counter (RPCTL.P) 



tt 

17 
II 

1> 

20 ' ' 

21 ' < 

22 ' i 




UPFER COUMTEN 



C^^ 



FORCED FASS 



15 Enable Interrupts (ENINT.P) 




ENABLE 

MASKABIE 

INTERRUPTS 



FORCED PASS 



17 Three-Way Branch, Upper 
Counter (TWBU.C) 


«a« 


'^_^^ ' STACK 


47 


•""^ 


4* 




4» 


1 


4B ' 

4C 


),Jfail 

PASS 


(D) 

(D»+l 
(0) + 2 




1 


(0)*» 



CONOmONAL 



12 Repeat Loop, Upper Counter 
(RFCTU.P) 



22 
2* 
24 
2S 
21 
271 
2f 




STACK 



C>d 



UPPER COUNTER 



FORCED FASS 



13 Repeat Pipeline. Lower 
Counter (RPCTL.F) 




FORCED FAtt. 



15 Disable InterrupU (OISINT.F) 




FORCED FAIL 



18 Test SP with D (TSTSP.P) 




FORCED PASS 
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18 Jump D II Slack Not Empty 
(■rSTMT.F) 




FOnCEOFAa. 



21 Conditional Jump Multiway 
HelBtlya D (CJMWR.C) 




24 Push PC and Conditional Load 
Upper Countar (PUSHPU.C) 



STACK 
UNCOMOmONAL 




urpEn couKTCH 

PASS 



CONOnOHAL 



26 Pop to Displacemant Width 
(POPOW.F) 



71 
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19 Conditional Jump O/Stack 
and Pop (CJOF.C) 




CONOnWNAI, 



22 Conditional Jump Subroutine 
Multiway Ralativ* O (CJSMWFtC) 



SMCK 




25 Load Uppar Counter and 
Conditional Puah Countar 
(LDUCq 



UFPencouKTEn 

UNCONOmOMAL 




STACK 
FASS 



CONOmONAL 



27 Load Displacement Width and 
Conditional Push DW (LDDW.C) 



DwnOTO REO 
UNCONDmOMAI. 




CONDmONAL 



20 Conditional Jump Subroutine 
D/Stack and Pop (CJSDF.C) 



^^^ 




CONOmOMAL 



23 Conditional Jump Pipeline 
andPap(CJPP) 

STACK 




FAS. PASS 

CONOmOKAL 



26 Pop to Upper Counter (POPUC.P) 



AC< 

«e 

«F t 



, . 1 ([STAClO 



UPPER COUNTEH 



FORCEOPASS ' 



2B Conditional Jump D PC Relstive 
(CJR.C) 



4B 

4C 
4D 
FAIL *t 
4F I 



C)-n 



JUMP ADDRESS IS 
CPC) ♦ O" 



D" is displacerrwnl (soe 1). 
CONOmOHAL 
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29 Conditional Jump D PC Relative 
Negative (CJHN.C) 



it " 

4A '• 
4B 

4C 'I 
4D n 



(>■ 



0" = -2 



PASS 



JUMP ADDRESS IS 
(PC) + 0"* 



FAIL 

D" = -2, should be two's complement (see 2). 

CONDITIONAL 



30 Conditional Jump Subroutine 
D PC Relative (CJSR.C) 



4A. 
4B^ 


1 


J 


4C ' 


FAIL 


40 < 


1 




" 


i 





PASS 

JUMP ADDRESS IS 
(PC) + D" 



D" Is displacement (see 1). 

CONDITIONAL 



31 Conditional Jump Subroutine 
D PC Relative Negative (CJSRN.C) 



49 
4A I 
4B 

4C I' 
40 



, *«_-^STAC 
-v-^ •'*SS 



STACK 



JUMP ADDRESS IS 
(PC) + D" 



FAIL 

-2, Should t>e two's complement (see 2). 

CONDITIONAL 



Notes: 1 . Ttie number of bits of D used as displacement is stored in DWIDTH register. The remaining high order bits are zero-extended. 
2. The number of bits of D used as displacement is stored in DWIDTH register. T>ie remaining high order bits are one-extended. 
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HOMEWORK - Am2910 



• Turn to your Am2900A Exercise and Laboratory Manual. 

Find the exercises for the Am2910 and perform exercises 1 through 
18 inclusive. 



• For homework, do the famous Coffee Machine problem in ED2900 
Exercise and Laboratory Manual. 



DESIGN EXAMPLE: 



e Solve the advanced traffic light problem using Boolean logic 
and the state diagram design approach. See ED2900 Exercise 
and Laboratory Manual. 



EVALUATION BOARD EXCERCISE 

Read Am29203 Evaluation Board description in ED2900A 
Exercise and Laboratory manual. 

Perform (Day 2) Am2910 sequencer laboratory experiments. 
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HOMEWORK DESIGN PROBLEM: 



THE FAMOUS COFFEE MACHINE 



(See ED2900A Exercise and Laboratory Manual) 
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MICROCYCLE TIMING - Am2910 
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ecu MICROCYCLE TIMING 



• The objective is to determine the minimum clock period 

possible for a given design yielding maximum execution speed. 



Each system design is different, requiring detailed analysis. 



• Always use maximum (guaranteed, worst-case) delay times and 
set-up times from the data sheet for the specific system 
component. 



The basic technique is straightforward: 

- find all possible paths from one register to another 

- calculate the path delay time using worst-case device times 

- the longest path determines the minimum clock period 

- if necessary, look for design changes to reduce the 
the time delay on the longest path 

- alternately, use a variable-length clock to 
accommodate the longer delays when needed 



e The timing analysis approach is learned by considering 
examples for the CCU using the Am2910. In addition, a 
similar analysis would be performed for the ALU and other 
system circuits and devices. 
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MICROCYCLE TIHING (CONT'D); 



Use the AMD Data Book for all Am2900 parts. 



Data for the non-Am2900 parts Is assumed. 
(For a real design, use the data sheets!) 



For the IR, status register, and pipeline register 
assume Schottky technology. Also shown are delays 
for the mapping PROM and the microprogram PROM. 



DEVICE 

Schottky Register 

clock-to-output 

OE-to-output 

data-set-up-time 

Mapping PROM 

address-to-output 
OE-to-output 

Microprogram PROM 

address-to-output 
OE-to-output 



MIN 



TYP 



9 

13 

2 



25 
15 



30 
18 



MAX 



15 
20 



45 
20 



50 
25 
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MICROCYCLE TIMING (CONT'D): 

• The architecture to be used in these examples is the typical 
computer CCU. 



« Although the ALU is not shown, a similar timing analysis must 
be conducted for its paths for a complete design. 



• Note that the Am2922 multiplexer Includes a latch on its input 
(I) lines that makes up part of the pipeline register. This 
allows a smaller overall part count. 



• Observe from the Data Book for the Am2910 that different 
instructions have different delay times. This means that 
each involved path has to be calculated for all possible 
Instructions. 



«) The potentially huge numbers of timing paths will, in practice, 
be reduced by experience. 



c» In addition to timing path diagrams, PERT charts are employed 
to find the longest path. 
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Am2910-1 SWITCHING CHARACTERISTICS 



The tables below define the Am29l0-1 switching characteristics. Tables A are setup and hold times relative to the clock LOW-to-HIGH 
transition. Tables B are combinational delays. Tables C are clocl( requirements. All measurements are made at 1.SV with input levels at 
or 3V. All values are in ns. All outputs have maximum DC loading. 



I. GUARANTEED CHARACTERISTICS OVER COMMERCIAL OPERATING RANGE 
Am291 0-1 DC (Ta = to + 70°C. VcC = 4.75 to 5.25V, Cu = 50pF) 



A. Set-up and Hold Times 



Inpul 


ts 


-h I 


D -*H 


24 


6 


Di-* PC 


58 , 


4 


'0-'3 


75 1 





cc 


63 1 





CCEN 


63 ! 





CI 


46 1 


5 


RLD 


36 1 


6 



B. Combinational Delays 



Input 


¥ 


PL, VECT. MAP 


Full 


Do-Di, 


20 


- 


- 


h-h 


50 


51 


- 


CC 


30 


- 


- 


CCEN 


30 


- 


- 


CP (Note 2) 
1-8,9. 15 


7S 


. 


60 


65 


- 


60 


CP 

All other 1 


55 


- 


60 


of (Note 3) 


35 30 


- 


- 



C. Clock Requirements (Note 1) 



Minimum Clock LOW Time 


50 


ns 


Minimum Clock HIGi-l Time 


35 


ns 


Minimum Clock Pervx). 
1 ■> a. 9. 15 (Note 2) 


113 


ns 


123 


Minimum Clock Period. I»14 


93 i ns 



Boldface times indicate speed selected 
cntical paths. 



II. GUARANTEED CHARACTERISTICS OVER MILITARY OPERATING RANGE 
Am2910-1DM (Tc " -55 to +125°C, Vcc = 4.5 to 5.5V, Cl = 50pF) 



A. Set-up and Hold Times 



B. Combinational Delays 



Input 


«s 


'h 


Di-*R 


28 


6 


Di-*PC 


62 1 


4 


'o-ij 


81 i 





CC 


65 1 





CCEN 


63 1 





CI 


SB 1 


5 


RLD 


42 1 


6 



Input 


y 


PL VECT. MAP 1 Full 


Do-Dii 


25 


- 


- 


lo-lj 


54 


58 


- 


CC 


35 


- 


- 


CCEN 


37 ! 


- 


CP (Note 2) 
1 - 8. 9. 15 


77 1 


67 


98 


- 


67 


CP 

All other 1 


61 


- 


67 


OE iNoie 31 


4030 1 


1 



C, Clock Requirements (Note 


1) 


Minimum Clock LOW Time 


1 53 1 ns 


Minimum Clock HIGH Time 


! "2 


ns 


Minirfium Clock Penoo. 
1 - 8. 9. 15 (Note 2) 


1 "* ! n. 
1 125 i 


Minimum Clock Penoa. 1=14 


i 100 1 ns 



NOTES: 

1 . Clock periods (or instructions not soecilies a'e oetermined by external 
conditions. 

2. These instructions are conditional on the counter. Uta the shorter 
specified deiay times if the previous instruction could proouce no 



change in the counter or could only decrement the counter. Use the 
longer deia ysfr om CP to outputs if the instruction prior to the ciock was 
4 or 12 or RLD was LOW. 
3 Enable/Disable. Disable times measured to 0.5V change on output 
voltage level with C^ » S.OpF. 
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CONTINUE INSTRUCTION TIMING ANALYSIS 

• Locate "all" register-to-register timing paths. 

- start at pipeline, CP -> output 

- Am2910 I->Y, CP->Y and I->PC-setup in parallel 

- after Am2910 output is stable, add micromemory 
address -> output delay 

- finally, setup for pipeline and Am2922 

« On PERT chart, assign worst-case times to each block, 
e Add up times along each path. 



« For PERT chart, converging paths must all be satisfied, 
hence use maximum time at that point (e.g. address input 
to micromemory). 



e Maximum path defines minimum clock cycle possible. 
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CONTINUE 
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PIPELINE 
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DEVICE PATH 
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! — °C SETUP 

CP — Y 
ADDR OUT 

SET-UP 

SET-UP 
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PATH3 
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119 



116 



PATH 5 



15 


15 


-_- 


70 




— 


— 


104 


— 


... 


... 


55 


50 


— 


50 


— - 


-:- 


11 


3 


--- 


— •- 



55 
50 



110 



ADVANCED MICRO DEVICES ^ 



1-2250 



ED2900A 



1-2250 




Pipeline 
Register 
Clock - Output 
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Microprogram 

Memory 

Addr - Output 
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Pipeline 
Register 
Setup 



CONTINUE INSTRUCTION 
PERT CHART 



ADVANCED MICRO DEVICES ^ 



JUMP MAP 



I 

O 



DATA BUS 



CLOCK 



t 

ft 

m 

O 

ft 

D 

m 

:S 
ft 

m 
(A 





1 


u 


Q 








Q 








INSTRUCTION 
REGISTER ^^^ 








m 


DM ( 






^° 








■ A 




MAP 


1 Am 2910 
/ SEQt ENCER 

/ PC 

STACK 


PL 






OE 


CP 






1 












CP 


Q 





CP 




cc 










¥ 


f 










1 










i 


A 




D 


STAT\JS 
REGISTER 


Am 2922 

CONDITION 

CODE 

MUX 

t 








MICROPROGRAM 
/ MEMORY 




• 
• 


• 
• 


TEST 
















I 








/ 










\ 










*" 




D 




















D 










PIPELINE 
REGISTER,^' 






PIPELINE 
REGISTER 




CP 


OE 






Y 








Y 




V 




• 











m 
o 

INJ 



I 

ro 

O 



1-2270 



ED2900A 



1-2270 



Pipeline 
Register 
Clock - Output 



15 




IR 

Clock ~ Output 



Am2910 
li - MAP 



51 



MAP PROM 
OE - Output 



20 



15 
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Pipeline 
Register 
Clock - Output 



Am2910 
Clock - 
Output 



MAP PROM 
ADDR - Output 
















Am2910 
Di - Yi 
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Am2910 
Di - PC 
Setup 
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(CONTINUE) 
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JUMP MAP 
PERT CHART 
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SPEEDING UP THE MICROCYCLE 



• Consider a change to the architecture to speed up the 
microcycle. 



Use combinatorial SSI circuits to decode the pipeline 
enable and map enable directly from the Am2910 instruction 
inputs. 



« Although the SSI delay is small, it too could be eliminated 
by driving the map and pipeline enables directly from the 
microword. 
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COFFEE MACHINE SOLUTION 



(See ED2900A Excercise and Laboratory Manual) 
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